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Abstract 

Index Coding has received considerable attention recently motivated in part by applications 
t-H such as fast video-on-demand and efficient communication in wireless networks and in part by its 

connection to Network Coding. Optimal encoding schemes and efficient heuristics were studied 
in various settings, while also leading to new results for Network Coding such as improved gaps 
between linear and non-linear capacity as well as hardness of approximation. The basic setting of 
Index Coding encodes the side-information relation, the problem input, as an undirected graph 
and the fundamental parameter is the broadcast rate j3, the average communication cost per 
bit for sufficiently long messages (i.e. the non-linear vector capacity). Recent nontrivial bounds 
on P were derived from the study of other Index Coding capacities (e.g. the scalar capacity 0i) 
by Bar-Yossef et al (2006), Lubetzky and Stav (2007) and Alon et al (2008). However, these 
indirect bounds shed little light on the behavior of f3: there was no known polynomial-time 
algorithm for approximating (3 in a general network to within a nontrivial (i.e. o(n)) factor, and 
the exact value of /3 remained unknown for any graph where Index Coding is nontrivial. 
1—1 Our main contribution is a direct information-theoretic analysis of the broadcast rate j3 

using linear programs, in contrast to previous approaches that compared /3 with graph-theoretic 
parameters. This allows us to resolve the aforementioned two open questions. We provide a 
ON polynomial-time algorithm with a nontrivial approximation ratio for computing /? in a general 

network along with a polynomial-time decision procedure for recognizing instances with /3 = 2. 
In addition, we pinpoint /3 precisely for various classes of graphs (e.g. for various Cayley graphs of 
• cyclic groups) thereby simultaneously improving the previously known upper and lower bounds 

for these graphs. Via this approach we construct graphs where the difference between /3 and its 
trivial lower bound is linear in the number of vertices and ones where j3 is uniformly bounded 
while its upper bound derived from the naive encoding scheme is polynomially worse. 

> 

1 Introduction 

In the Index Coding problem a server holds a set of messages that it wishes to broadcast over 
a noiseless channel to a set of receivers. Each receiver is interested in one of the messages and 
has side-information comprising some subset of the other messages. Given the side-information 
map as an input, the objective is to devise an optimal encoding scheme for the messages (e.g., one 
minimizing the broadcast length) that allows all the receivers to retrieve their required information. 

This notion of source coding that optimizes the encoding scheme given the side-information 
map of the clients was introduced by Birk and Kol [8j and further developed by Bar-Yossef et al. 
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in (7]. Motivating applications include satellite transmission of large files (e.g. video on demand), 
where a slow uplink may be used to inform the server of the side-information map, namely the 
identities of the files currently stored at each client due to past transmissions. The goal of the 
server is then to issue a shortest possible broadcast that allows every client to decode its target file 
while minimizing the overall latency. See [7J|8j|TT| and the references therein for further applications 
of the model and an account of various heuristic/rigorous Index Coding protocols. 

The basic setting of the problem (see [4]) is formalized as follows: the server holds n messages 
xi, . . . , x n E X where |S| > 1, and there are m receivers R\, . . . , R m . Receiver Rj is interested in 
one message, denoted by and knows some subset N(j) of the other messages. A solution of 

the problem must specify a finite alphabet £p to be used by the server, and an encoding scheme 
£ : S n — > Sp such that, for any possible values of x\, . . . , x n , every receiver Rj is able to decode the 
message from the value of £(x\, . . . ,x n ) together with that receiver's side-information. The 
minimum encoding length i = |~log 2 |Sp|] for messages that are t bits long (i.e. |S| = 2*) is denoted 
by f3t(G), where G refers to the data specifying the communication requirements, i.e. the functions 



f(j) and N(j). As noted in 23 , due to the overhead associated with relaying the side-information 



map to the server the main focus is on the case t > 1 and namely on the following broadcast rate. 

P (G)± lim &M = in f&M (1.1) 

t— >oo t t t 

(The limit exists by sub-additivity.) This is interpreted as the average asymptotic number of 
broadcast bits needed per bit of input, that is, the asymptotic broadcast rate for long messages. In 
Network Coding terms, (3 is the vector capacity whereas (3\ is a scalar capacity. 

An important special case of the problem arises when there is exactly one receiver for each 
message, i.e. m = n and f(j) = j for all j. In this case, the side-information map N(j) can 
equivalently be described in terms of the binary relation consisting of pairs (i, j) such that Xj £ 
These pairs can be thought of as the edges of a directed graph on the vertex set [n] or, in case 
the relation is symmetric, as the edges of an undirected graph. This special case of the problem 
(which we will hereafter identify by stating that G is a graph) corresponds to the original Index 
Coding problem introduced by Birk and Kol (§], and has been extensively studied due to its rich 
connections with graph theory and Ramsey theory. These connections stem from simple relations 
between broadcast rates and other graph-theoretic parameters. Letting a(G),x(G) denote the 
independence and clique-cover numbers of G, respectively, one has 

a(G)</3(G)</3i(G)<x(G). (1.2) 

The first inequality above is due to an independent set being identified with a set of receivers with 
no mutual information, whereas the last one due to |7,8| is obtained by broadcasting the bitwise 
XOR of the vertices per clique in the optimal clique-cover of G. 

1.1 History of the problem 

The framework of graph Index Coding and its scalar capacity f3\ were introduced in j8j, where 
Reed-Solomon based protocols hinging on a greedy clique-cover (related to the bound /3i < x) 
were proposed and empirically analyzed. In a breakthrough paper ff\, Bar-Yossef et al. proposed a 
new class of linear index codes based on a matrix rank minimization problem. The solution to this 
problem, denoted by mimk^G), was shown to achieve the optimal linear scalar capacity over GF(2) 
and in particular to be superior to the clique-cover method, i.e. f3\ < minrk2 < x- The parameter 



2 



minrk2 was extended to general fields in |23| , where arguments from Ramsey Theory showed that 
for any e > there is a family of graphs on n vertices where /3i < n 6 while minrk2 > n l ~ £ for 
any fixed e > 0. The first proof of a separation /3 < f}\ for graphs was presented by Alon et al. 
in the proof introduces a new capacity parameter (3* such that (3 < [3* < Pi and shows that 
the second inequality can be strict using a graph-theoretic characterization of (3*. In addition, 
the paper studied hypergraph Index Coding (i.e. the general broadcasting with side information 
problem, as defined above) , for which several hard instances were constructed — ones where (3 = 2 
while f3* is unbounded and others where (3* < 3 while (3\ is unbounded. The first proof of a 
separation a < (3 for graphs is presented in a companion paper [9j; the proof makes use of a new 
technique for bounding (3 from below using a linear program whose constraints express information 
inequalities. The paper then uses lexicographic products to amplify this separation, yielding a 
sequence of graphs in which the ratio j3/a tends to infinity. The same technique of combining 
linear programs with lexicographic products also leads to an unbounded multiplicative separation 
between non-linear and vector- linear Index Coding in hypergraphs. 

As is clear from the foregoing discussion, the prior work on Index Coding has been highly 
successful in bounding the broadcast rate above and below by various parameters (all of which 
are, unfortunately, NP-hard to compute) and in coming up with examples that exhibit separations 
between these parameters. However it has been less successful at providing general techniques that 
allow the determination (or even the approximation) of the broadcast rate (3 for large classes of 
problem instances. The following two facts starkly illustrate this limitation. First, the exact value 
of (3(G) remained unknown for every graph G except those for which trivial lower and upper bounds 
a(G),x(G) coincide. Second, it was not known whether the broadcast rate (3 could be approximated 
by a polynomial-time algorithm whose approximation ratio improves the trivial factor n (achieved 
by simply broadcasting all n messages) by more than a constant factor Q 

In this paper, we extend and apply the linear programming technique recently introduced in [9] 
to obtain a number of new results on Index Coding, including resolving both of the open questions 
stated in the preceding paragraph. The following two sections discuss our contributions, first to 
the general problem of broadcasting with side information, and then to the case when G is a graph. 

1.2 New techniques for bounding and approximating the broadcast rate 

The technical tool at the heart of our paper is a pair of linear programs whose values bound (3 above 
and below. The linear program that supplies the lower bound was introduced in |9j and discussed 
above; the one that supplies the upper bound is strikingly similar, and in fact the two linear 
programs fit into a hierarchy defined by progressively strengthening the constraint set (although 
the relevance of the middle levels of this hierarchy to Index Coding, if any, is unclear). 

Theorem 1. Let G be a broadcasting with side information problem, having n messages and m 
receivers. There is an explicit sequence of n information-theoretic linear programs, each one a 
relaxation of its successors, whose respective solutions b\ < hi < . . . < b n are such that: 

(i) The broadcast rate (3 satisfies bi < (3 <b n , and both of the inequalities can be strict. 

(ii) When G is a graph, the extreme LP solutions b\ and b n coincide with the independence number 
a(G) and the fractional clique-cover number Xf(G) respectively. 

As a first application of this tool, we obtain the following pair of algorithmic results. 
1 When G is a graph, it is not hard to derive a polynomial-time o(n)-approximation from (|1.2[|. 
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Capacities Best previous New separation Appears in 
compared bounds in graphs results Section 



P-a G(n - 56 ) G(n) 


2.4 






4.1 




Pl-P «0.32 G(n) 
Px/P « 1.32 1.5-o(l) 


2.4 
2.4 




P*-P G(n) 


2.4 





Table 1: New separation results for Index Coding capacities in n- vertex graphs 



Theorem 2. Let G be a broadcasting with side information problem, having n messages and m 
receivers. Then there is a polynomial time algorithm which computes a parameter r = t{G) such 
that 1 < < O(n lo f^°& n ) . There is also a polynomial time algorithm to decide whether P{G) = 2. 

In fact, the O(n l ° 1 g o 1 °^"' ) approximation holds in greater generality for the weighted case, where 
different messages may have different rates (in the motivating applications this can correspond e.g. 



to a server that holds files of varying size). The generalization is explained in Section 3.2 
1.3 Consequences for graphs 

In Section [5] we demonstrate the use of Theorem [T] to derive the exact value of P(G) for various 
families of graphs by analyzing the LP solution 62- As mentioned above, the exact value of P(G) 
was previously unknown for any graph except when the trivial lower and upper bounds — a(G) and 
x(G) — coincide, as happens for instance when G is a perfect graph. Using the stronger lower and 
upper bounds 62 and b n , we obtain the exact value of P(G) for all cycles and cycle-complements: 
P(C n ) = n/2 and P(C n ) = n/[^\. In particular this settles the Index Coding problem for the 5- 
cycle investigated in [4|[7j[9], closing the gap between 62 (C5) = 2.5 and P*(C*,) = 5 — log 2 5 2.68. 
These results also provide simple constructions of networks with gaps between vector and scalar 
Network Coding capacities. 

We also use Theorem[T]to prove separation between broadcast rates and other graph parameters. 
Our results, summarized in Table[TJ improve upon several of the best previously known separations. 
Prior to this work there were no known graphs G where Pi(G) — P(G) > 1. (For the more 
general setting of broadcasting with side information, multiplicative gaps that were logarithmic 
in the number of messages were established in pi.) In fact, merely showing that the 5-cycle 
satisfies 2 < P < Pi = 3 required the involved analysis of an auxiliary capacity P* , discussed 
earlier in Section |1.1| With the help of our linear programming bounds (Theorem [TJ we supply in 



Section 2.4 a family of graphs on n vertices where P\ — P is linear in n, namely p = n/2 whereas 
p x = (1 - I l g 2 5 - o(l))n rj 0.54n. 

We turn now to the relation between P{G) and x*(G), the upper bound provided by our LP 
hierarchy. As mentioned earlier, Lubetzky and Stav [23] supplied, for every e > 0, a family of 
graphs on n vertices satisfying P(G) < Pi(G) < n £ while Xf(G) > n 1_£ , thus implying that Xf(G) 
is not bounded above by any polynomial function of P{G). We strengthen this result by showing 
that Xf(G) is not bounded above by any function of P(G). To do so, we use a class of projective 
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Hadamard graphs due to Erdos and Renyi to prove the following theorem in Section [4. 1| 

Theorem 3. There exists an explicit family of graphs G on n vertices such that /3(G) = 3 whereas 
the Index Coding encoding schemes based on clique-covers cost at least Xf(G) = Q{n 1 ^) bits. 

Recall the natural heuristic approach to Index Coding: greedily cover the side-information graph 
G by r > x(G) cliques and send the XORs of messages per clique for an average communication 
cost of r. A similar protocol based on Reed-Solomon Erasure codes was proposed by [8] and was 
empirically shown to be effective on large random graphs. Theorem [3] thus presents a hard instance 
for this protocol, namely graphs where (3 = 0(1) whereas x(G) is polynomially large. 

2 Linear programs bounding the broadcast rate 

In this section we present linear programs that bound the broadcast rate f3 below and above, using 
an information-theoretic analysis. We demonstrate this technique by determining P{C§) precisely; 
later, in Section [5j we determine /3 precisely for various infinite families of graphs. 

2.1 The LP hierarchy 

Numerous results in Network Coding theory bound the Network Coding rate (e.g., [2| [l2|[l7|[l8|[26] ) 
by combining entropy inequalities of two types. The first is purely information-theoretic and holds 
for any set of random variables; the second is derived from the graph structure. An important 
example of the second type of inequality, that we refer to as "decoding" , enforces the following: if 
a set of edges A cuts off a set of edges B from all the sources, then any information on edges in B 
is determined by information on edges in A. We translate this idea to the setting of Index Coding 
in order to develop stronger lower bounds for the broadcast rate. 

Definition 2.1. Given a broadcasting with side information problem and subsets of messages A, B, 
we say that A decodes B (denoted A B) if A C B and for every message x £ B \ A there is a 
receiver Rj who is interested in x and knows only messages in A (i.e. XfU) = x and N(j) C A). 

Remark 2.2. For graphs, A ~~* B if A C B and for every v £ B \ A all the neighbors of v are in A. 

If we consider the Index Coding problem on G and a valid solution £ , then the relation A ^ B 
implies H(A, £ (xi, . . . , x n )) > H(B,£(xi, . . . ,x n )), since for each message in B \ A there is a 
receiver who must be able to determine the message from only the messages in A and the public 
channel £(x\, . . . ,x n ). (Here and in what follows we denote by H(X,Y) the joint entropy of the 
random variables X, Y.) Combining these decoding inequalities with purely information-theoretic 
inequalities, one can prove lower bounds on the entropy of the public channel, a process formalized 
by a linear program (that we denote by B2) whose solution 62 constitutes a lower bound on (3. 
(See (9}[28] for more on information-theoretic LPs.) Interestingly, B2 fits into a hierarchy of 



increasing linear programs such that the last LP in the hierarchy gives an upper bound on (3. 



Definition 2.3. For a broadcasting with side information problem on a set V of n messages, 
the /3 -bounding LP hierarchy is the sequence of LPs, denoted by B\, B2, B3, ■ ■ ■ , B n with solutions 
61,62, • • • ,6„, given by: 
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k-th level of the LP hierarchy for the broadcast rate 



minimize X{v>) 
subject to: 

X{V) > n 

X(0) > 

X{S) + \T\S\ >X(T) 
X{T) > X{S) 
X{A) > X(B) 

Etcr(-1) IRXTI X(TuZ)<0 



VS C T C V 

V4,BCF : A ~* B 
MR C V : 2 < \R\ < k 
VZ CV : ZC\R = ® 



{initialize) 
{non-negativity) 

{slope) 
{monotonicity) 
{decode) 

{\R\-th order submodularity) 



Remark 2.4. The above defined 2-th order submodularity inequalities are equivalent to the classical 
submodularity inequalities whereby X{S) + X{T) > X{S n T) + X{S U T) for all S, T. 

Theorem [T] traps (3 in the solution sequence of the above-defined hierarchy and characterizes its 
extreme values for graphs. The proofs of these results appear in Section |2.2[ and in what follows 
we first outline the arguments therein and the intuition behind them. 

As mentioned above, the parameter 62 is the entropy-based lower bound via Shannon inequal- 
ities that is commonly used in the Network Coding literature. To see that indeed j3 > 62 we 
interpret a solution to the broadcasting problem as a feasible primal solution to B2 via the assign- 
ment X{A) = H{A U £{xi, . . . , x n )). The proof that a{G) = 61(G) for graphs is similarly based on 
constructing a feasible primal solution to B\, this time via the assignment X{A) = \A\ +max{|/| : 
/ is an independent set disjoint from A}. (The existence of this primal solution justifies the in- 
equality b\ < a; the reverse inequality is an easy consequence of the decoding, initialization, and 
slope constraints.) 

To establish that (3{G) < b n {G) when G is a graph we will show that b n {G) = Xf(G), the 
fractional clique-cover number of G, while Xf(G) is an upper bound on (5. For a general broadcasting 
network G we will follow the same approach via an analog of Xf for hypergraphs. It turns out that 
there are two natural generalizations of cliques and clique-covers in the context of broadcasting 
with side information. 

Definition 2.5. A weak hyperclique of a broadcasting problem is a set of receivers J such that for 
every pair of distinct elements Ri, Rj £ J , f{i) belongs to N{j). A strong hyperclique is a subset of 
messages TCF such that for any receiver Rj that desires Xfu\ G T we have that T C N{j)U{f{j)}. 

A weak fractional hyperclique- cover is a function that assigns a non-negative weight to each 
weak hyperclique, such that for every receiver Rj, the total weight assigned to weak hypercliques 
containing Rj is at least 1. A strong fractional hyperclique- cover is defined the same way, except 
that the weights are assigned to strong hypercliques and the coverage requirement is applied to 
receivers rather than messages. In both cases, the size of the hyperclique-cover is defined to be the 
sum of all weights. 

Observe that if T is any set of messages and J is the set of all receivers desiring a message in 
T, then T is a strong hyperclique if and only if J is a weak hyperclique. However, it is not the 
case that every weak hyperclique can be obtained from a strong hyperclique T in this way. 
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Observe also that if J is a weak hyperclique and each of the messages ar/m (Rj G J) is a single 
scalar value in some field, then broadcasting the sum of those values provides sufficient information 
for each Rj G J to decode This provides an indication (though not a proof) that /3 is 

bounded above by the weak fractional hyperclique cover number. The proof of Theorem [T]JI]) in fact 
identifies b n as being equal to the strong fractional hyperclique-cover number, which is obviously 
greater than or equal to its weak counterpart. The role of the n th -order submodularity constraints 
is that they force the function F(S) = X(S) — \S\ to be a weighted coverage function. Using this 
representation of F it is not hard to extract a fractional set cover of V, and the sets in this covering 
are shown to be strong hypercliques using the decoding constraints. 

Finally, we will show that one can have (3 > bi using a construction based on the Vamos matroid 
following the approach used in [13] to separate the corresponding Network Coding parameters. As 
for showing that one can have /3 < b n , we will in fact show that one can have /3 < 63 < b n . 

We believe that the other parameters 63, ... , b n -\ have no relation to (3, e.g. as noted above we 
show that there is a broadcasting instance for which /3 < 63 and thus 63 is not a lower bound on f3. 

2.2 Proof of Theorem Q 



In this section we prove Theorem [T] via a series of claims. The main inequalities involving the 
broadcast rate (3 are shown in { 2.2.1| whereas the constructions demonstrating that these inequalities 



can be strict appear in i 2.2.2 



2.2.1 Bounding the broadcast rate via the LP hierarchy 

We begin by familiarizing ourselves with the framework of the LP-hierarchy through proving the 
following straightforward claim regarding the LP-solution b\ and the graph independence number. 

Claim 2.6. If G is a graph then the LP-solution b\ satisfies b\(G) = a{G). 

Proof. In order to show that b\{G) > a(G), let I be an independent set of maximal size in G. 
Now, V \ I V implies that X(V \ I) > X(V) > n is true for any feasible solution. Additionally, 
X(V \ I) < X(0) + \V\I\. Combining these together, we get A(0) > \V\ -\V\I\ = |/| = a{G). 
To prove b\{G) < a(G) we present a feasible solution to the primal attaining the value a(G), 

X(S) = \S\ + max{|/| : / is an independent set disjoint from S} , (2-1) 

We verify that the solution is feasible by checking that it satisfies all the constraints of B\ . The 
fact that X{V) = n implies the initialization constraint is satisfied. To prove the slope constraint, 
for S C T C V let I, J be maximum-cardinality independent sets disjoint from S, T respectively. 
Note that J itself is disjoint from S, implying | J| < \I\. Thus we have 

X(T) = \T\ + \J\ = \S\ + \T\S\ + \J\ <\S\ + \T\S\ + |/| = X(S) + \T\ S\. 

Note also that / \ T is an independent set disjoint from T, hence it satisfies \I \ T\ < \ J\. Thus 

X{T) = \T\ + \J\ > |T| + \I\T\ = \TUI\>\SUI\ = \S\ + |/| =X{S), 

which verifies monotonicity. Finally, to prove decoding let A, B be any vertex sets such that A B. 
Consider G\A, the induced subgraph of G on vertex set V \ A. Every vertex of B \ A is isolated 
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in G \ A, and consequently if / is a maximum-cardinality independent set disjoint from B, then 
I U (B \ A) is an independent set in G \ A. Therefore, 



X(A) > \A\ + |/| + \B\A\ = \B\ + \I\ = X(B) . U 

We next turn to showing that b 2 is a lower bound on the broadcast rate. 
Claim 2.7. The LP-solution b 2 satisfies b 2 (G) < j3(G). 

Proof. Let G be a broadcasting with side information problem with n messages V and m receivers. 
Consider the message P = S(x%, . . . , x n ) that we send on the public channel to achieve f3. Denote 
by H the entropy function normalized so that H(xi) = 1 for all i. This induces a function from the 
power set of V U P to 1Z where H(S) = \S\ for any subset of messages S and H(P) = (3. 

Now, let X(S) = H(S,P) for SCI/, We will show that X satisfies all the constraints of the 
LP £>2, implying X it is a feasible solution B 2 . 

First, X(V) > n since H(V,P) = H(V) and our normalization has H(V) = n. Non-negativity 
holds because H{P) > 0. The X{-) values satisfy monotonicity and submodularity because entropy 
does. Slope is implied by the fact that entropy is submodular (that is, H (S, P) + H(T \ S) > 
H(T, P)) together with our normalization. Finally, decoding is satisfied because the coding solution 
is valid: each receiver Rj can determine its sought information from N(j) and the public channel. 

This solution gives X($) = H(P) = /3 and since the LP is stated as a minimization problem it 
implies that (3 is an upper bound on its solution b 2 - ■ 

Next we prove that j3 < b n . We do this in three parts. First, for every instance G of the 
broadcasting with side information problem, we define a parameter Xf(G) be the minimum size 
of a strong fractional hyperclique-cover; this parameter specializes to the fractional clique-cover 
number when G is a graph. Next we show that (3 < Xf, an d finally we prove that Xf = b n . 

Claim 2.8. For any broadcasting problem with side information, G, we have (3(G) < Xf(G). 

Proof. Let C be the set of strong hypercliques in G = (V,E). If Xf < w then there is a finite 
collection of ordered pairs {(S, xg) : S £ C} where the xg's are positive rational numbers satisfying 

xs = w , and x$ > 1 for all x G V . 

SeC SedxeS 

Let q be a positive integer such that each of the numbers xs (S G C) is an integer multiple of 1/q. 
Set p = qw, noting that p is also a positive integer. Letting ys = qxs for every S G C, we have: 



ys = p , and ys > q for all x G V . (2-2) 



E 

SeC SeC-.xeS 



Replacing each pair (£*, ys) with ys copies of the pair (5, 1) if necessary, we can assume that ys = 1 
for every S. Similarly, replacing each S by a proper subset if necessary, we can assume that the 



inequality (2.2) is tight for every x. (Note that this step depends on the fact that the collection 
of strong hypercliques, C, is closed under taking subsets.) Altogether we have a sequence of sets 
Si, S 2 , . . . , S p , each of which is a strong hyperclique in G, such that every message occurs in exactly 
q of these sets. 
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From such a set system it is easy to construct an index code where every message has q bits (i.e. 
E = {0, l} 9 ) and the broadcast utilizes p bits (i.e. £p = {0, 1} P ). Indeed, for each message x G V 
let ji(x) < 32{x) < • • • < j q (x) denote the indices such that x G 5j for j G {ii(x), J2(%), ■ ■ ■ i3q{ x )}- 
If the bits of message x are denoted by b± (x), 62(^)5 • • • 1 ^(^O then for each 1 < i < p the i-th bit 
of the index code is computed by taking the sum (modulo 2) of all bits bk(z) such that z G Si and 
i = jk(z). Receiver R = (S,x) is able to decode the k th bit of x by taking the jk(x)-th bit of the 
index code and subtracting various bits belonging to other messages x' G Sj k r x ) . All of these bits are 
known to R since Sj k i x ) is a strong hyperclique containing x. This confirms that /3(G) < p/q = w, 
as desired. ■ 



It remains to characterize the extreme upper LP solution: 
Claim 2.9. The LP-solution b n satisfies b n {G) = Xf(G). 

Proof. The proof hinges on the fact that the entire set of constraints of B n gives a useful structural 
characterization of any feasible solution X. Once we have this structure it will be simple to infer 
the required result. 

Lemma 2.10. A vector X satisfies the slope constraint and the i-th order submodularity constraints 
for i G {2, ...,n} if and only if there exists a vector of non-negative numbers w(T), defined for 
every non-empty set of messages T, such that X(S) = \S\ + X^T-Tgs W (T) for all S C V. 

The proof of this fact is similar to a characterization of a weighted coverage function. While 
much of the proof is likely folklore, we include it in Section [6] for completeness. 

Given this fact we now prove that b n (G) > Xf{G) by showing that any solution X having the 



form stated in Lemma 2.10 is a fractional coloring of G. Thus, for the remainder of this subsection, 



X refers to a solution of B n having value b n (G) and w refers to the associated vector of non- negative 



numbers whose existence is guaranteed by Lemma |2, 10 



Fact 2.11. For every message x G V, Y1tbx w (T) = 1. 



To see this, observe that monotonicity and decoding imply that X(V\{x}) = X(V). Lemma 2.10 
implies that the right-hand-side is n while the left-hand-side is n — 1 + ^2 TBx w(T). 

Fact 2.12. For every receiver Rj, if x denotes then I]r- x6 TciV(j)u{i} w (^) = 1- 



Indeed, monotonicity and decoding imply that X(N(j) U{x}) = X(N(j)). Lemma 2.10 implies 
that the right side and left side differ by 1 — Er-nercjvwufi} W (T)- 

For a message x, let N(x) = f]j. x=x . N(j) be the intersection of the side information for every 



receiver who wants to know x. By combining Facts 2.11 and 2.12 we find that if w(T) is positive 



then T is contained in N(x) U {x} for every x in T. Thus, we can infer the following: 

Corollary 2.13. Ifw(T) > then the set of receivers desiring messages inT is a strong hyperclique. 

Now, to prove b n {G) < Xf(G) we show that if a vector w gives a feasible fractional coloring then 
X(S) = \S\ + Y1iT-t<£S W (T) is feasible for the LP B n . By the argument made in the proof of Claim 



2.8 we can assume without loss of generality that Y^tbu w (T) = 1 Vn G F. X has value equal to the 



fractional coloring because X(ft>) = Y2t w {T). Further, Lemma 2.10 implies that X satisfies the i-th 



order submodularity constraints and slope. It trivially satisfies initialization and non-negativity. 
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To show that X satisfies monotonicity it is sufficient to prove that X(S U {u}) > X(S) for all 
SQV,uEV\S. By definition, we have X(SU{u}) — X(S) = 1 — J2t-u<= t<£ s w (T)- Additionally, 
we know ^2t-ugt<£S w (T) — YIt-ugt w (T) = 1 5 where the last equality is because w is a fractional 
coloring. Finally, for the decoding constraints, it is sufficient to show that X(A) > X(AL) {x}) 
for A = N(j) where Rj is a receiver who desires x. By definition of X, X(A) — X(A U {x}) = 

T,T:x&TCN(j)u{x} w (. T ) ~ L Also > E T: x e T c N(j)u{x} W ( T ) = J2 T 3x w ( T ) = 1 because T with 
w(T) > is a strong hyperclique. ■ 

2.2.2 Strict lower and upper bounds for the broadcast rate 

Claim 2.14. There exists a broadcasting with side information instance G for which /3(G) < 63(G). 

Proof. The construction is an extremely simple instance with only three messages {a,b,c} and 
three receivers ({a}, b), ({b}, c), and ({c}, a). It is easy to see that a © 6, b © c is a valid solution, 
and thus /3 < 2. However, using the 3rd-order submodularity constraint we have that 

X(ab) + X(bc) + X(ac) + X(0) > X(abc) + X(a) + X(b) + X(c). 

Combining that with decoding inequalities 

X{a) > X(ab) , X(b) > X(bc) , X(c) > X(ac) , 

together with the initialization inequality X(abc) > 3 now gives us that 63 = X($) > 3. ■ 

2.3 The broadcast rate of the 5-cycle 

As stated in Theorem [TJ whenever the LP-solution b<i equals Xf we obtain that (5 is precisely this 
value, hence one may compute the broadcast rate (previously unknown for any graph) via a chain 
of entropy-inequalities. We will demonstrate this in Section [5] by determining /3 for several families 



of graphs, in particular for cycles and their complements (Theorem 5.1). These seemingly simple 



cases were previously studied in 41 7] yet their /3 values were unknown before this work. 



To give a flavor of the proof of Theorem 5.1 we provide a proof- by-picture for the broadcast rate 



of the 5-cycle (Figure [TJ , illustrating the intuition behind choosing the set of inequalities one may 
combine for an analytic lower bound on j3. The inequalities in Figure [l] establish that /3(C 5 ) > §, 
thus matching the upper bound /3(Cs) < X/(Cs) = §• 

We note that odd cycles on n > 5 vertices as well as their complements constitute the first 



examples for graphs where the independence number a is strictly smaller than /3. Corollary 2.15 
will further amplify the gap between these parameters. 

2.4 Corollaries for vector/scalar index codes 

Prior to this work and its companion paper [9] there was no known family of graphs where a 7^ (3, 
and one could conjecture that for long enough messages the broadcast rate in fact converges to the 
independence number, the largest set of receivers that are pairwise oblivious. We now have that 
the 5-cycle provides an example where a = 2 while (3 = |, however here the difference (3 — a < 1 
could potentially be attributed to integer-rounding, e.g. it could be that a = [f3\ . 

Such was also the case for the best known difference between the vector capacity j3 and the 
scalar capacity /3i. The best lower bound on f3\ — (3 in any graph was again attained by the 5-cycle 
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Figure 1: A proof-by-picture that /3(Cs) = |. Variables marked by highlighted subsets of vertices, 
e.g. the first submodularity application applies the LP constraint X({3,4,5}) + X({2, 3, 4}) > 
X({2,3,4,5}) + X({3,4}). Final outcome is a proof that /3(C 5 ) > X(0) with3X(0) + 5 > X(0) + 1O. 

where it was slightly less than ^, and again in the constrained setting of graph Index Coding we 
could conjecture that Pi = [~/3] . 

The following corollary of the above mentioned results refutes these suggestions by amplifying 
both these gaps to be linear in n. The separation between a and (5 was further strengthened in the 
companion paper j9], where we obtained a gap of a polynomial factor between these parameters. 

Corollary 2.15. There exists a family of graphs G on n vertices for which (3(G) = n/2 while 
a{G) = \n andl3i{G) = (1- \ log 2 5+o(l))n ra 0.54n. Moreover, we have f3*(G) = (l-o(l))/3i(G). 

To prove this result we will use the direct-sum capacity (3* . Recall that this capacity is defined 
to be P*(G) = limt_5.oo jPi{t ■ G) = inf t j(3i(t ■ G) where t ■ G denotes the disjoint union of t copies 
of G. This parameter satisfies (5 < f3* < f3\. Similarly we let G + H denote the disjoint union of 
the graphs G, H. We need the following simple lemma. 

Lemma 2.16. The parameters (3 and f3* are additive with respect to disjoint unions, that is for 
any two graphs G, H we have p{G + H) = (3(G) + (3(H) and (3*(G + H) = (3*{G) + (3*(H). 

Proof of lemma. The fact that f3* is additive w.r.t. disjoint unions follows immediately from the 
results of jij. Indeed, it was shown there that for any graph G on n vertices f3*(G) = log 2 Xf(£(G)) 
where £ = £(G) is an appropriate undirected Cayley graph on the group Z 2 . Furthermore, it was 
shown that £(G + H) = £(G)VC(-fT), where V denotes the OR- graph-product. It is well-known 
(see, e.g., |15[|21| ) that the fractional chromatic number is multiplicative w.r.t. this product, i.e. 
Xf(G\/H) = Xf(G)xf(H) for any two graphs G, H. Combining these statements we deduce that 

2 nG+H) = X/(£(G + H)) = Xf{(t{GWH)) = x/ (c(G)) X /(e(iO) = 2^( G )+w . 
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We shall now use this fact to show that f3 is additive. The inequality (3(G + H) < /3(G) + /3(H) 
follows from concatenating the codes for G and H and it remains to show a matching upper bound. 



As observed by 23 , the Index Coding problem for an n-vertex graph G with messages that are 
t bits long has an equivalent formulation as a problem on a graph with tn vertices and messages 
that are 1-bit long; denote this graph by Gt (formally this is the i-blow-up of G with independent 
sets, i.e. the graph on the vertex set V(G) x [t], where (it, i) and (v,j) are adjacent iff uv S E(G)). 
Under this notation (3t(G) = fi\{Gi). Notice that (G + H)t = Gt + Ht for any t and furthermore that 
s ■ Gt is a spanning subgraph of G s t for any s and t, in particular implying that /3\(s ■ Gt) > f3\(G s t). 

Fix e > and let t be a large enough integer such that /3(G + H) > /3t(G + H)/t — e. Further 
choose some large s such that (3*(Gt) > Pi(s ■ Gt)/s — e and (3* (Hi) > 0i(s ■ H t )/s — e. We now get 

P(G + H) + e > p x (G t + H t ) It > (3*(G t + H t )/t = P*(G t )/t + (3*(H t )/t , 

where the last inequality used the additivity of f3* . Since 

(3*(G t )/t > fa,{s ■ G t )/st - e > Px(G st )/st - e > (3(G) - e 

and an analogous statement holds for f3*(H t )/t, altogether we have (3(G + H) > (3(G) + (3(H) — 3e. 
Taking e — > completes the proof of the lemma. ■ 



Proof of Corollary 2.15[ Consider the family of graphs on n = 5k vertices given by G = k ■ C\ 



5- 



It was shown in [4j that f}*{C$) = 5 — log 2 5, which by definition implies that (3*(G) = (5 — log 2 5)k 
and f3\(G) = f3*(G) + o(k). At the same time, clearly a(G) = 2k and combining the fact that 



f3(C§) = | with Lemma 2.16 gives (3(G) = 5k/2 = n/2, as required. 



The above result showed that the difference between the broadcast rate f3 and the Index Coding 
scalar capacity f3\ can be linear in the number of messages. We now wish to use the gap between 
f3 and f3\ to infer a gap between the vector and scalar Network Coding capacities. 

Corollary 2.17. For any k > 1 there exists a Network Coding instance on 5k + 2 vertices where 
the ratio between the vector and scalar-linear capacities is precisely 1.2 while the ratio between the 
vector and scalar capacities converges to 1 — \ log 2 5 1.07 as k — > oo. 



Proof. It is well known (e.g. [25] ) that an n-vertex graph Index Coding instance G can be translated 
into a capacitated network H on 2n + 2 vertices via a reduction that preserves linear encoding. It 
thus suffices to bound the ratio of the corresponding Index Coding capacities. 



For k > 1 consider the graph G consisting of k disjoint 5-cycles. Corollary 2.15 established that 
/3(G) = 5k/2 whereas j3\(G) = (5 — log 2 5 + o(l))k where the o(l)-term tends to as k — > oo. At the 
same time, it was shown in [7] that the scalar-linear Index Coding capacity over GF(2) coincides 
with a parameter denoted by minrk 2 (G), and as observed in [23] this extends to any finite field F as 
follows: For a graph H = (V, E) we say that a matrix B indexed by V over F is a representation of 
H over F if it has nonzero diagonal entries (B uu ^ for all u £ V) whereas B uv = for any a/w 
such that uv ^ E. The smallest possible rank of such a matrix over F is denoted by minrki^-ff). 
For the 5-cycle we have minrkp(C5) < x(Cs) = 3 by the linear clique-cover encoding and this is 
tight by as mmrkf(C^) > [jS(Cs)] = 3. Finally, minrkp is clearly additive w.r.t. disjoint unions of 
graphs by its definition and thus minrk^G) = 3k as required. ■ 
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3 Approximating the Broadcast Rate 

This section is devoted to the proof of Theorem [2j on polynomial-time algorithms for approximating 
j3 and deciding whether (3 = 2. Working in the setting of a general broadcast network is somewhat 
delicate and we begin by sketching the arguments that will follow. 

In the simpler case of undirected graphs, a o(n)-approximation to (3 is implied by results of [5 , 10 



27 that together give a polynomial time procedure that finds either a small clique-cover or a large 



independent set (see Remark 3.1 ). To get an approximation for the general broadcasting problem we 
will apply a similar technique using analogues of independent sets and clique-covers that give lower 
and upper bounds respectively on the general broadcasting rate. The analogue of an independent 
set is an expanding sequence — a sequence of receivers where the i th receiver's desired message 
is unknown to receivers — 1. The clique-cover analogue is a weak fractional hyperclique- 



cover (see Definition 2.5). In the remainder of this section, whenever we refer to hypercliques or 
hyperclique-covers we always mean weak hypercliques and weak hyperclique-covers. 

We will prove that there is a polynomial time algorithm that outputs an expanding sequence of 
size k or reports a fractional hyperclique-cover of size O (fcn 1_1//fc ); the approximation follows by 
setting k appropriately. We will argue that either we can partition the graph and apply induction or 
else the side-information map is dense enough to deduce existence of a small fractional hyperclique- 
cover. The proof of the latter step deviates significantly from the techniques used for graphs, and 
seems interesting in its own right. We will give a simple procedure to randomly sample hypercliques 
and use it to produce a valid weight function for the hyperclique-cover by defining the weight of a 
hyperclique to be proportional to the probability it is sampled by the procedure. 

To prove the second part of Theorem[2]we will prove that a structure called an almost alternating 
cycle (AAC) constitutes a minimal obstruction to obtaining a broadcast rate of 2. The proof makes 
crucial use of Theorem [TJ calculating the parameter 62 for AAC's to prove that their broadcast rate 
is strictly greater than 2. Furthermore, the proof reduces finding an AAC to finding the transitive 
closure of a particular relation, which is polynomial time computable. 

3.1 Approximating the broadcast rate in general networks 

We now present a nontrivial approximation algorithm for /3 for a general network described by a 
hypergraph (that is, the most general framework where there are m>n receivers). 

Remark 3.1. In the setting of undirected graphs a slightly better approximation algorithm for (3 



is a consequence of a result of Boppana and Halldorsson |10|, following the work of Wigderson 27 



In 10 the authors showed an algorithm that finds either a "large" clique or a "large" independent 
set in a graph (where the size guarantee involves the Ramsey number estimate). A simple adaptation 
of this result (Proposition 2.1 in the Alon-Kahale [5] work on approximating a via the function) 
gives a polynomial-time algorithm for finding an independent set of size ifc(m) = max js : ( fc ^7 2 ) — 
m} in any graph satisfying x(G) > n/k + m. In particular, taking m = n/k with k = | logn we 
clearly have tk(m) > k for any sufficiently large n and obtain that either x(G) < 4n/logn or we 
can find an independent set of size ^ log n in polynomial-time. 

We use the following notation: the n message streams are identified with the elements of 
[n] = V. The data consisting of the pairs {(N(j), f(j))}J^i is our directed hypergraph instance. 
When referring to the hypergraph structure itself (rather than the corresponding index coding 
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problem) we will refer to elements of V as vertices and we will refer to pairs (iV(j'), f(j)) as directed 
hyperedges. For notational convenience, we denote S(j) = N(j) U {/ (j)}. 

An expanding sequence of size A; is a sequence of receivers j\ , . . . , jk such that 

f{u)t{JS{i) (3.1) 

i<e 

for 1 < £ < k. For a hypergraph G, let a(G) denote the maximum size of an expanding sequence. 
Lemma 3.2. Every hypergraph G satisfies the bound /3(G) > a(G). 

Proof. The proof is by contradiction. Let ji, . . . ,jk be an expanding sequence and suppose that 
there is an index code that achieves rate r < k. Let J = {ji, ■ ■ ■ ,jk}- For b = log 2 |S| we have 

|£|fc = 2 bk > 2 br > | Sp |_ 

Let us fix an element x* G X for every i {/(j) : j G </}, and define ^ to be the set of all 
x G S n that satisfy x« = x* for all i {/(j) : j G J}. The cardinality of ^ is |S| fc , so the 
Pigeonhole Principle implies that the function £, restricted to is not one-to-one. Suppose that 
x and y are two distinct elements of ^ such that £{x) = £(y). Let i be the smallest index such that 
x f(ji) ^ VfUi)- Denoting ji by j, we have Xk = Vk f° r all fc G N(j), because N(j) does not contain 
f(je) for any £ > i, and the components with indices ■ ■ ■ ,jk are the only components in 

which x and y differ. Consequently receiver j is unable to distinguish between message vectors x, y 
even after observing the broadcast message, which violates the condition that j must be able to 
decode message f(j). ■ 

Lemma 3.3. Let ipf(G) denote the minimum weight of a fractional weak hyperclique- cover of G. 
Every hypergraph G satisfies the bound [3(G) < tpf(G). 

Proof. The linear program defining ipf{G) has integer coefficients, so G has a fractional hyperclique 
cover of weight w = ?pf(G) in which the weight w(J) of every hyperclique J is a rational number. 
Assume we are given such a fractional hyperclique-cover, and choose an integer d such that w(J) is 
an integer multiple of 1/d for every J . Let C denote a multiset of hypercliques containing d • w(J) 
copies of J for every hyperclique J . Note that the the cardinality of C is d ■ w. 

For any hyperclique J, let f(J) denote the set [Jj £ j{f(j)}- For each i G [n], let C{ denote the 
sub- multiset of C consisting of all hypercliques J G C such that i G f{J). Fix a finite field F such 
that |F| > dw. Define S = ¥ d and Sp = Y d ' w . Let {Cp}jec be a basis for the dual vector space 
Ep and let {£,f} jed be a set of dual vectors in X* such that any d of these vectors constitute a 
basis for S*. (The existence of such a set of dual vectors is guaranteed by our choice of F with 
|F| > dw > d.) 

The encoding function is defined to be the unique linear function satisfying 

g(£(x 1 ,...,x n ))= Y, £f(^) 
ief(J) 

For each receiver j, if i = f(j), the set of dual vectors £f with j G J compose a basis of £*, hence 
to prove that j can decode message x\ it suffices to show that j can determine the value of £f(xi) 
whenever j G J . This holds because the public channel contains the value of Yleef(J) £f( x e)' an< ^ 
receiver j knows that value of £f(x£) for every £ ^ i in f(J) because £ G N(j). ■ 
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We now turn our attention to bounding the ratio ipf(G)/a(G) for a hypergraph G. Our goal 
is to show that this ratio is bounded by a function in o(n). To begin with, we need an analogue 
of the lemma that undirected graphs with small maximum degree have small fractional chromatic 
number. 

Lemma 3.4. If G is a hypergraph with n vertices, and d is a natural number such that for every 
receiver j, \S(j)\ + d> n, then ipj{G) < 4d + 2. 

Proof. Let us define a procedure for sampling a random subset T C [n] and a random hyperclique 
J as follows. Let n be a uniformly random permutation of [n + d], let i be the least index such 
that Tr(i + 1) > n, and let T be the set {tt(1), 7r(2) , . . . , vr(i)}. (If vr(l) > n then i = and T is the 
empty set.) Now let J be the set of all j such that f(j) G T C S'(j). (Note that J is indeed a 
hyperclique.) 

For any hyperclique J let p(J) denote the probability that J is sampled by this procedure and 
let w{J) = (4d + 2) ■ p{J). We claim that the weights w(-) define a fractional hyperclique-cover of 
G, or equivalently, that for every receiver j, IP(/(j) G T C S'(j)) > jgq^- Let t/(j) denote the set 
{/(j)} U ([n] \ %)) U ([n + d] \ [n]) . The event £ = {/(j) G T C S(j)} occurs if and only if, in the 
ordering of U(J) induced by tt, the first element of U(j) is f(j) and the next element belongs to 
[n + d] \ [n]. Thus, 

W£) = -1 d - 

1 ) \u(j)\ \u(j)\-r 

The bound P(£) > now follows from the fact that \U(j)\ < 2d+ 1. ■ 

Lemma 3.5. If G is a hypergraph and a(G) < k, then ipf(G) < 6k ■ n 1_1//fc . Moreover, there is 
a polynomial-time algorithm, whose input is a hypergraph G and a natural number k, that either 
outputs an expanding sequence of size k + 1 or reports (correctly) that i^f(G) < 6k ■ n 1_1 / fc . 

Proof. The proof is by induction on k. In the base case k = 1, either G itself is a hyperclique 
or there is some pair of receivers such that f(j) is not in S(j'). In that case, the sequence 
ji = 3 > }J2 = j is an expanding sequence of size 2. 

For the induction step, for each hyperedge j define the set D(j) = {f(j)} U ([n] \ S(j)) and 
let ji be a hyperedge such that |-D(j')| is maximum. 

If \D(Ji)\ < n 1-1/fe + 1, then the bound 



+ n 1_1 / fe > n is satisfied for every j and Lemma 3.4 implies that if^f(G) < 4n 1 ~ 1 / fe + 2 < 
g n i-i/A,^ Otherwise, partition the vertex set of G into V\ = [n] \ S(j\) and Vi = S(ji), and 
for i = 1,2 define Gi to be the hypergraph with vertex set Vi and edge set Ei consisting of all 
pairs (N(j) n Vi,f(j)) such that (N(j),f(j)) is a hyperedge of G with f(j) G V;. (We will call 
such a structure the induced sub-hypergraph of G on vertex set Vi.) If G\ contains an expanding 
sequence j2,i3, • • • , jk+i of size k, then the sequence Ji,j2, • • • ,jk+i is an expanding sequence of 
size k + 1 in G. (Moreover, if an algorithm efficiently finds the sequence j2, J3, • • • ,jk+i then it 
is easy to efficiently construct the sequence ji, . . . ,jk+i-) Otherwise, by the induction hypothesis, 
G\ has a fractional hyperclique-cover of weight at most 6(k — 1)| Vij 1 " 1 /^ -1 ) < 6(k — l)|yi|n _1 / fc . 
Continuing to process the induced sub-hypergraph on vertex set V2 in the same way, we arrive at 
a partition of [n\ into disjoint vertex sets W%, W2, ■ ■ ■ , Wi of cardinalities n\, . . . , rig, respectively, 
such that for 1 < i < £, the induced sub-hypergraph on Wi has a fractional clique-cover of weight 
at most 6(k — l)nin _1 / fe , and for i = £ the induced sub-hypergraph on Wj satisfies the hypothesis 
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of Lemma 3.4 with d = n 1 l l k and consequently has a fractional hyperclique-cover of weight at 



most Qn l 1 l k . The lemma follows by summing the weights of these hyperclique-covers. ■ 
Combining Lemmas 3.2 3.3, 3.5, we obtain the approximation algorithm asserted by Theorem [2] 



3.2 Extending the algorithm to networks with variable source rates 

The aforementioned approximation algorithm for (3 naturally extends to the setting where each 
source in the broadcast network has its own individual rate. Namely, the n message streams are 
identified with the elements of [n] = V, where message stream i has a rate rj, and the problem input 
consists of the vector (n,...,r„) and the pairs f(j))}JI =1 . Thus the input is a weighted 

directed hypergraph instance. An index code for a weighted hypergraph consists of the following: 

• Alphabets £p and Sj for 1 < i < n, 

• An encoding function £ : JlILi ^* ~~ ^ ^-Pj 

• Decoding functions Vj : £p x flieivy) ^* ~~ ^ ^70) • 
The encoding and decoding functions are required to satisfy 

T>j(£(ai, . . .,a n ),a N ^) = cr /(j) 

for all j = l,...,m and all (o"i, . . . , a n ) G TlT=i Here the notation <JN{j) denotes the tuple 
obtained from a complete n-tuple . . . , cr n ) by retaining only the components indexed by elements 
of N(j). An index code achieves rate r > if there exists a constant b > such that > 2 bri 
for 1 < i < n and |Sp| < 2 br . If so, we say that rate r is achievable. If G is a weighted hypergraph, 
we define (3{G) to be the infimum of the set of achievable rates. 

The first step in generalizing the proof given in the previous subsection to the case where the fj's 
are non- uniform is to properly extend the notions of hypercliques and expanding sequences. A weak 
fractional hyperclique cover of a weighted hypergraph will now assign a weight w{J) to every weak 



hyperclique J such that for every receiver j, YljBj w (^) — r f(j) ( c ^- Definition 2.5 corresponding 
to rf(j) = 1). As before, the weight of a fractional weak hyperclique-cover is given by J2j w (>?) 
and for a weighted hypergraph G we let i^f{G) denote the minimum weight of a fractional weak 



hyperclique-cover. An expanding sequence ji, ■ ■ ■ ,jk is defined as before (see Eq. 3.1) except now 
we associate such a sequence with the weight Yle=i r f(ji) an< ^ the quantity a(G) will denote the 
maximum weight of an expanding sequence (rather than the maximum cardinality). 

With these extended defintions, the proofs in the previous subsection carry unmodified to the 



weighted hypergraph setting with the single exception of Lemma 3.5 , where the assumption that the 
hypergraph is unweighted was essential to the proof. In what follows we will qualify an application 
of that lemma via a dyadic partition of the vertices of our weighted hypergraph according to their 
weights rj. 

Assume without loss of generality that < < 1 for every vertex i £ [n], and partition the 
vertex of set G into subsets V\, V2, ■ ■ ■ such that V s contains all vertices i such that 2~ s < Vi < 2 1_s . 
Let G s denote the induced hypergraph on vertex set V s . For each of the nonempty hypergraphs 



G s , run the algorithm in Lemma 3.5 for k = 1,2,... until the smallest value of k(s) for which an 



expanding sequence of size k(s) + 1 is not found. If G° s denotes the unweighted version of G s , then 
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we know that 

a{G s ) > 2- s a{G°) > 2" s A;(s) 

ipf(G s ) < 2 1 -%(G°) < 2~ s • 12fc(a)n 1_1/fcW . 

In addition, for each i £ V s the set of hyperedges containing i constitutes a hyperclique, which 
implies the trivial bound 

M°s)< < 2 1 - S |T4|. 

Combining these two upper bounds for ipf(G s ), we obtain an upper bound for ipf(G): 

oo oo 

tpf{G) < j2^a g s) < y, 2 ~ s ■ min { 12k ( s ) nl ~ 1/k(s) ^ 2 \ v *\} ■ ( 3 - 2 ) 

s=l s=l 



We define t(G) to be the right side of (3.2). We have described a polynomial-time algorithm to 
compute t(G) and have justified the relation ipf(G) < t(G), so it remains to show that t{G) / a{G) < 

cn ( ^y^^ j for some constant c. 

The bound t(G) < n follows immediately from the definition of r, so if a{G) > i^°f ogn there is 
nothing to prove. Assume henceforth that a{G) < J"^ , and define w to be the smallest integer 
such that 2 W ■ a(G) > 2l ^ gn - We have 

w oo 

r(G) < Y, 2 ' 3 ■ 12k (s)n^ 1/k{s) + Yl 21_S -I^l 

S=l 5=10+1 
W 

< l2nY 2 ' S Ks)n~ l/k[s) + 2~ w -n 

s=l 

< 12na(G) £ n -V*W + 2na(G) (^p) > (3-3) 



with the last line derived using the relations 2- s k(s) < a(G s ) < a{G) and 2~ w < a(G)( 21 g° gn ) . 
Applying once more the fact that 2~ s k(s) < a(G), we find that n~ l l k ^ < n -1 ^ 2 "'"^). Substitut- 
ing this bound into (3.3) and letting a denote a(G), we have 

4^ < 2n ( l ° gl0gn ) + 12n ( + n" 1 ^ + . . . + n -i/*-«) . 
a(G) \ logn / V / 

In the sum appearing on the right side, each term is the square of the one following it. It now 
easily follows that the final term in the sum is less than 1/2, so the entire sum is bounded above 
by twice its final term. Thus 

l&<2n +24n.n-V^. (3.4) 

a{G) V lo g n / 

Our choice of w ensures that 2 w a < ^Lgn hence J* -2 "" < n ~ log log n/ log n = (logn)" 1 . By 
substituting this bound into (3.4) we obtain 

r(G) < n /21oglogn 24 



a(G) \ logn logn/ 
as desired. 
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3.3 Proof of Theorem [2J, determining whether the broadcast rate equals 2 

Let G be an undirected graph with independence number a = 2. Clearly, if G is bipartite than 
x(G) = 2 and so (3(G) = 2 as well. Conversely, if G is not bipartite then it contains an odd cycle, 
the smallest of which is induced and has k > 5 vertices since the maximum clique in G is a(G) = 2. 
In particular, Theorem 5.1 implies that (3(G) > (3(Ck) = i k i 2 \ > 2. We thus conclude the following: 

Corollary 3.6. Let G be an undirected graph on n vertices whose complement G is nonempty. 
Then (3(G) = 2 if and only if G is bipartite. 



A polynomial time algorithm for determining whether (3 = 2 in undirected graphs follows as 



an immediate consequence of Corollary 3.6 However, for broadcasting with side information in 
general — or even for the special case of directed graphs (the main setting of |7,8|) — it is unclear 
whether such an algorithm exists. In this section we provide such an algorithm, accompanied by 
a characterization theorem that generalizes the above characterization for undirected graphs. To 



state our characterization we need the following definitions. As in Section 3.1 we use S(j) to denote 
the set N(j) U {/(j)}. Additionally, we introduce the notation T(j) to denote the complement of 
S(j) in the set of messages. When referring to the message desired by receiver Rj, we abbreviate 
Xf(j) to x(j). Henceforth, when referring to a hypergraph G = (V,E), we assume that for each 
edge j G E, the hypergraph structure specifies the vertex f(j) and both of the sets S(j),T(j). 

Definition 3.7. If G = (V, E) is a directed hypergraph and 5 is a set, a function F : V — > S is 
said to be G-compatible if for every edge j G E, there are two distinct elements t,u G S such that 
F maps every element of T(j) to t, and it maps f(j) to u. 

Definition 3.8. If G = (V,E) is a directed hypergraph, an almost alternating (2n+l)-cycle in 
G is a sequence of vertices v- n , v- n+ \, . . . , v n , and a sequence of edges jo, . . . ,j n , such that for 
i = 0, . . . , n, the vertex f(ji) is equal to vi- n and the set T(j{) contains Vi, as well as v i + \ if i < n. 

Theorem 3.9. For a directed hypergraph G the following are equivalent: 

(i) (3(G) = 2 

(ii) There exists a set S and a G-compatible function F : V — > S. 
(Hi) G contains no almost alternating cycles. 

Furthermore there is a polynomial-time algorithm to decide if these equivalent conditions hold. 



Proof. (|ij)=^(iii): The contrapositive statement says that if G contains an almost alternating cycle 
then (3(G) > 2. Let v— n , . . . , v n be the vertices of an almost alternating (2n + l)-cycle with edges 
jo, • • • ,j n - To prove (3(G) > 2 we manipulate entropy inequalities involving the random variables 
{xi : — n < i < n} and y, where Xi denotes the message associated to vertex Vi normalized to have 
entropy 1, and y denotes the public channel. For S C {y, X- n , . . . , x n }, let H (S) denote the entropy 
of the joint distribution of the random variables in S, and let H(S) denote H(S). Let Stj denote 
the set {xi, Xi + \, . . . , xj}. 

For < % < n — 1, we have 

H(y) + (2n - 2) > H({ Xi _ n , x h x i+1 }) = H({x u x i+1 }) = H(S i:i+1 ) , (3.5) 

where the second equation holds because receiver j, can decode message Xj_ n = x(ji) given the 
value y and the values x^ for k G N(ji). Using submodularity we have that for < j < n, 

H(S 0:j ) + H(S r . 1+1 ) > H(S : j+ i) + H( Xj ) = H(S : j+ i) + H($) = H(S 0:j+1 ) + 2n + 1 . (3.6) 
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Summing up (3.6) for j = 1, ... ,n — 1 and canceling terms that appear on both sides, we obtain 



n-1 



H(S j:j+1 ) > H{S ..n) + (n - l)(2n + 1) . (3.7) 

i=o 



Summing up (3.5 ) for i = 0, . . . , n — 1 and combining with ( |3.7[ ) we obtain 

nH(y) + n(2n - 2) > F(S :n) + (n - l)(2n + 1) . (3.8) 

Now, observe that 

iT(£ 0:n ) + n - 1 > H(x , a:n) > 5fa„) > #(0) = 2n + 1 . (3.9) 



Summing (3.8) and ( |3.9| ), we obtain 

ni?(y) + 2n 2 - n - 1 > 2n 2 + n 

and rearranging we get H{y) > 2 + n , from which it follows that /3(G) > 2 + nT 1 . 

(iii)=^([ii]): Define a binary relation jj on the vertex set V by specifying that v$w if there exists an 



edge j such that {v, w} C T(j). Let ~ denote the transitive closure of (j. Define F to be the quotient 
map from V to the set S of equivalence classes of ~. We need to check that F is G-compatible. 
For every edge j G i?, the definition of relation jj trivially implies that F maps all of T{j) to a 
single element of S. The fact that it maps f(j) to a different element of S is a consequence of 
the non-existence of almost alternating cycles. A relation f(j) ~ v for some v £ T(j) would imply 
the existence of a sequence vq, . . . , v n such that vq = f(j), v n = v, and v$Vi+i for i = 0, n — 1. 
If we choose for < i < n to be an edge such that T(ji) contains Vi,Vi+\ (such an edge exists 
because v^Vi + i) and we set j n = j and Vi- n = f(ji) for i = 0, . . . , n — 1, then the vertex sequence 
v_ n , ■ ■ ■ ,v n and edge sequence jo, . . . , j n constitute an almost alternating cycle in G. 

Computing the relation ~ and the function F, as well as testing that F is G-compatible, can 
easily be done in polynomial time, implying the final sentence of the theorem statement. 

([ii])=^([i]): If F : V — > S is G-compatible, we may compose F with a one-to-one mapping from 
S into a finite field F, to obtain a function (ft : V — > ¥ that is G-compatible. The public channel 
broadcasts two elements of F, namely: 

y = ^2 Xv 

v 
v 

Receiver Rj now decodes message x(j) as follows. Let c denote the unique element of F such that 
4>{v) = c for every v in T(j). Using the pair (y, z) from the public channel, Rj can form the linear 
combination 

cy - z = ^2[c - (p(v)]x v . 

V 

We know that every v G T(j) appears with coefficient zero in this sum. For every v € N(j), receiver 
Rj knows the value of x v and can consequently subtract off the term [c — <p(v)]x v from the sum. 
The only remaining term is [c — (p(x(j))]x(j). The coefficient c — 4>(x(j)) is nonzero, because (f) is 
G-compatible. Therefore Rj can decode x(j). ■ 



19 



4 The gap between the broadcast rate and clique cover numbers 



4.1 Separating the broadcast rate from the extreme LP solution b n 

In this section we prove Theorem [3] that shows a strong form of separation between j3 and its upper 
bound b n = Xf- Not only can we have a family of graphs where j3 = 0(1) while Xf is unbounded, 
but one can construct such a family where Xf grows polynomially fast with n. 

Proof of Theorem [3], The following family of graphs (up to a small modification) was introduced 



by Erdos and Renyi in 14 . Due to its close connection to the (Sylvester-)Hadamard matrices when 
the chosen field has characteristic 2 we refer to it as the projective-Hadamard graph H(¥ q ): 

1. Vertices are the non-self-orthogonal vectors in the 2-dimensional projective space over ¥ q . 

2. Two vertices are adjacent iff their corresponding vectors are non-orthogonal. 

Let q be a prime-power. We claim that the projective-Hadamard graph H(¥ q ) on n = n(q) 
vertices satisfies /3 = 3 while Xf = Q{n 1 ^). The latter is a well-known fact which appears for 
instance in |6| 24 . Showing that Xf ^ (1 — o(l))n 1 / 4 is straightforward and we include an argument 
establishing this for completeness. 

The fact that (3 > 3 follows from the fact that the standard basis vectors form an independent 
set of size 3. A matching upper bound will follow from the minrkF parameter defined in Section [2.4| 
Let F be some finite field and let t = minrk]p(G) be the length of the optimal linear encoding over 
¥ for the Index Coding problem of a graph G with messages taking values in F. Broadcasting 
^[~log 2 |F|] bits allows each receiver to recover his required message in F and so clearly (3 < I. It 
thus follows that [/3(G)] < minrkF(G) for any graph G and finite field F. 

Here, dealing with the projective-Hadamard graph H, let B be the Gram matrix over ¥ q of 
the vectors corresponding to the vertices of H. By definition the diagonal entries are nonzero and 
whenever two vertices u, v are nonadjacent we have B uv = 0. In particular B is a representation 
for H over ¥ q which clearly has rank 3 as the standard basis vectors span its entire row space. 
Altogether we deduce that P(H) = 3 whereas Xf = @(n 1//4 ), as required. 

The fact that > (1 — o(l))n _1//4 will follow from a straightforward calculation showing that 
the clique-number of H is at most (1 + o(l))q 3//2 = (1 + o(l))n 3//4 . 

Consider the following multi-graph G which consists of the entire projective space: 

1. Vertices are all vectors of the 2-dimensional projective space over ¥ q . 

2. Two (possibly equal) vertices are adjacent iff their corresponding vectors are orthogonal. 

Clearly, G contains the complement of the Hadamard graph H (¥ q ) as an induced subgraph and it 
suffices to show that a{G) < (1 + o(l))q 3 / 2 . 

It is well-known (and easy) that G has N = q 2 + q + 1 vertices and that every vertex of G is 
adjacent to precisely q + 1 others. Further observe that for any u, v £ V(G) precisely one vertex of 
G belongs to {u, v} (as u, v are linearly independent vectors). In other words, the codegree of any 
two vertices in G is 1. We conclude that G is a strongly- regular graph (see e.g. [16] for more details 
on this special class of graphs) with codegree parameters fj, = v = 1 (where /U is the codegree of 
adjacent pairs and v is the codegree of non-adjacent ones). There are thus precisely 2 nontrivial 
eigenvalues of G given by ^((/i — v) ± \/ — v) 2 + 4(g + 1 — u)) = ±^/q, and in particular the 
smallest eigenvalue is Xm = —yfq- Hoffman's eigenvalue bound (stating that a < ^™^ m for any 
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regular m-vertex graph with largest and smallest eigenvalues Ai, X m resp., see e.g. [16]) now shows 

(n\ <r ~ NXn (g!+l±lVg 3/2 , , r 
a{G)<——— — = v = q l +q + y/q, 

(q+l)-X N q-y/q+1 
as required. ■ 

In addition to demonstrating a large gap between Xf and (5 on the projective-Hadamard graphs, 
we show that even in the extreme cases where G is a triangle- free graph on n vertices, in which 
case Xf{G) > n/2, one can construct Index Coding schemes that significantly outperform Xf- We 
prove this in Section 4.2 by providing a family of triangle-free graphs on n vertices where j3 < |n. 



4.2 Broadcast rates for triangle-free graphs 

In this section we study the behavior of the broadcast rate for triangle- free graphs, where the upper 
bound b n on (3 is at least n/2. The first question in this respect is whether possibly /3 = b n in this 
regime, i.e. for such sparse graphs one cannot improve upon the fractional clique-cover approach 
for broadcasting. This is answered by the following result. 

Theorem 4.1. There exists an explicit family of triangle-free graphs on n vertices where Xf > n /2 
whereas the broadcast rate satisfies /3 < |n. 

The following lemma will be the main ingredient in the construction: 

Lemma 4.2. For arbitrarily large integers n there exists a family J- of subsets of [n] whose size 
is at least 8n/3 and has the following two properties: (i) Every A G T has an odd cardinality, 
(ii) There are no distinct A,B,C £ J 7 that have pairwise odd cardinalities of intersections. 

Remark 4.3. For n even, a simple family T of size 2n with the above properties is obtained by 
taking all the singletons and all their complements. However, for our application here it is crucial 
to obtain a family T of size strictly larger than 2n. 

Remark 4.4. The above lemma may be viewed as a higher-dimensional analogue of the Odd- 
Town theorem: If we consider a graph on the odd subsets with edges between those with an odd 
cardinality of intersection, the original theorem looks for a maximum independent set while the 
lemma above looks for a maximum triangle-free graph. 

Proof of lemma. It suffices to prove the lemma for n = 6 by super-additivity (we can partition a 
ground-set [N] with N = 6m into disjoint 6-tuples and from each take the original family J 7 ). 

Let U\ = {{x} : x G [5]} be all singletons except the last, and Ui = {^4u{6} : A C [5] , \A\ = 2}. 
Clearly all subsets given here are odd. 

We first claim that there are no triangles on the graph induced on Ui- Indeed, since all subsets 
there contain the element 6, two vertices in Ui are adjacent iff their corresponding 2-element subsets 
A, A' are disjoint, and there cannot be 3 disjoint 2-element subsets of [5]. 

The vertices of U\ form an independent set in the graph, hence the only remaining option for a 
triangle in the induced subgraph on U\ U U2 is of the form {x}, (A U {6}), (A' U {6}). However, to 
support edges from {x} to the two sets in U2 we must have that x belongs to both sets, and since 
x ^ 6 by definition we must have x G A Pi A'. However, we must also have A n A' = for the two 
vertices in U2 to be adjacent, contradiction. 
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To conclude the proof observe that adding the extra set [5] does not introduce any triangles, 
since U\ is an independent set while [5] is not adjacent to any vertex in U2 (its intersection with any 
set (A U {6}) G U2 contains precisely 2 elements). Altogether we have | F\ = 5 + (2) + 1 = |n. ■ 

Proof of Theorem I4.1L Let T be the family provided by the above lemma and consider the graph 
G whose ./V vertices are the elements of J- with edges between A, B whose cardinality of intersection 
is odd. By definition the graph G is triangle-free and we have Xf(G) > N/2. 

Next, consider the binary matrix M indexed by the vertices of G where Mab = \A D B\ 
(mod 2). All the diagonal entries of M equal 1 by the fact that T is comprised of odd subsets only, 
and clearly M is a representation of G over GF(2). At the same time, M can be written as FF T 
where F is the N x n incidence-matrix of the ground-set [n] and subsets of T . In particular we 
have that rank(M) < rank(i ? ) < n over GF(2). This implies that minrk^G) < n and the proof is 
now concluded by the fact that /3(G) < minrk^G). ■ 



Remark 4.5. The construction of the family of subsets T in Lemma 4.2 relied on a triangle-free 
15-vertex base graph H which is equivalent to the Peterson graph with 5 extra vertices added to 
it, each one adjacent to one of the independent sets of size 4 in the Peterson graph. 

Having discussed the relation between (3 and b n for sparse graphs we now turn our attention to 
the analogous question for the other extreme end, namely whether (3 = b\ when b\ = a attains its 
smallest possible value (other than in the complete graph) of 2. 

4.3 Graphs with a broadcast rate of nearly 2 

We now return to the setting of undirected graphs, where the class of {G : (3(G) = 2} is simply the 
complements of nonempty bipartite graphs, where in particular Index Coding is trivial. It turns 
out that extending this class to {G : (3(G) < 2 + e} for any fixed small e > already turns this 
family of graphs to a much richer one, as the following simple corollary of Theorem [T] shows. Recall 
that the Kneser graph with parameters (n, k) is the graph whose vertices are all the /c-element 
subsets of [n] where two vertices are adjacent iff their two corresponding subsets are disjoint. 

Corollary 4.6. Fix < e < ^ and let G be the complement of the Kneser(n, k) graph on N = (?) 
vertices for n = (2 + e)k. Then (3(G) < 2 + e whereas x(G) > (e/2) log N. 

Proof. Using topological methods, Lovasz [22] proved that the Kneser graph with parameters (n, k) 
has chromatic number n — 2k + 2, in our case giving that x(G) = ek + 2 < (e/2) log N (with the 
last inequality due to the fact that > [e(2 + e)] fc and so k > \ log A). At the same time, it is well 
known that G satisfies Xf = n /k (its maximum clique corresponds to a maximum set of intersecting 
A;-subsets, which has size u = (tZi) by the Erdos-Ko-Rado Theorem, and being vertex-transitive 
it satisfies X/ = N/oS). The bound (3 < b n =Xf given in Theorem [l] thus completes the proof. ■ 

5 Establishing the exact broadcast rate for families of graphs 
5.1 The broadcast rate of cycles and their complements 

The following theorem establishes the value of (3 for cycles and their complements via the LP 
framework of Theorem [TJ 
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Theorem 5.1. For any integer n > 4 the n-cycle satisfies (3(C n 
satisfies f3(C n ) = n/\n/2\ . In both cases j3\ = while a = \J3\. 



n/2 whereas its complement 



Proof. As the case of n even is trivial with all the inequalities in (1.2) collapsing into an equality 
(which is the case for any perfect graph), assume henceforth that n is odd. We first show that 
P(C n ) = n/2. Putting n = 2k + 1 for k > 2, we aim to prove that 62 > k + 1/2, which according 
to Theorem [l] will imply the required result since clearly Xf = k + 1/2. 

Denote the vertices V of the cycle by 0, 1, ... , 2k. Further define: 

(Evens) , 
(Odds) , 
(Evens decoded) , 
(Odds decoded) , 
(Middle) . 



E 


= {i 


i = mod 2 


i^2k} 





= {i 


i = 1 mod 2} 




E+ 


= {i 


i < 2k - 2} 




+ 


= {i 


1 < i < 2k - 


1} 


M 


= {i 


\<i<2k- 


2} 



Next, consider the following constraints in the LP 82- 

X(Q) + k> X{E) 
X(Q) + k > X(0) 
X($) + l>X{{2k}) 
X(E) > X(E + ) 

x(p) > x(o + ) 

X{E + ) + X{0 + ) > X(V) + X(M) 
X(M) + X({2k}) > X(V) + X($) 
2X(V) > 2(2fc + l) 



(slope) 
(slope) 
(slope) 
(decode) 
(decode) 
(submod , decode) 
(submod , decode) 
(initialize) . 



Summing and canceling we get 2X(0) + 2k + 1 > 4k + 2, implying X($) > k + 1/2. The main 
idea of this proof, as with the ones to follow, is that we input some sets of vertices and then apply 
decoding to the sets as well as combine them together using submodularity to eventually output 
X(V) and X(0). 

It remains to treat complements of odd cycles. Let H = AAC n be the complement of a directed 



odd almost-alternating cycle on n vertices (as defined in Section 3.3). Treating C n as a directed 
graph (replacing each edge with a bi-directed pair of edges) it is clearly a spanning subgraph of H, 
hence (3(C n ) is at least as large as (3(H). The proof in Section 



3.3 



establishes that /3(H) > 



_ K2J ' 

translating to a lower bound on (3(C n ). The matching upper bound follows from the fact that due 
to Theorem jlj we have (3(C n ) < x(C n ) = ^ n " 2 j ■ ' 

5.2 The broadcast rate of cyclic Cay ley Graphs 



In this section we demonstrate how the same framework of the proof of Theorem 5.1 may be applied 
with a considerably more involved sequence of entropy-inequalities to establish the broadcast rate of 
two classes of Cayley graphs of the cyclic group Z n . Recall that a cyclic Cayley graph on n vertices 
with a set of generators G C {1,2,..., \n/2\ } is the graph on the vertex set {0, 1, 2, . . . , n — 1} 
where is an edge iff j — i = g (mod n) for some g G G. 
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Theorem 5.2. For any n>4, the 3-regular Cayley graph ofL n has broadcast rate f3 = n/2. 

Theorem 5.3 (Circulant graphs). For any integers n > 4 and k < , the Cayley graph of Z„ 
with generators {±1, . . . , ±k} has broadcast rate /3 = n/(k + 1). 

To simplify the exposition of the proofs of these theorems we make use of the following definition. 

Definition 5.4. A slice of size % in 2^ indexed, by x is the subset of i contiguous vertices on the 
cycle given by {x + j (mod n) : < j < i}. 

Proof of Theorem 15.21 It is not hard to see that for a cyclic Cayley graph to be 3-regular it 
must have two generators, 1 and n/2, and n must be even. If n is not divisible by four, then it is 
easy to check that there is an independent set of size n/2 and Xf 1S a ^ so n/2. Thus, it immediately 
follows that /3 = n/2. For 3-regular cyclic Cayley graphs where n is divisible by four, a is strictly 
less than n/2. So to prove that (3 = n/2 we use the LP Bi to show 62 > n/2, implying j3 > n/2. 

Let 0, 1, 2, . . . , 4/c — 1 be the vertex set of the graph. We assume that any solution X has cyclic 
symmetry. That is, X(S) = X({s + i\s € S}) for all i G [0,4k— 1]. This assumption is without loss 
of generality because we can take any LP solution X and find a new one X' that is symmetric and 
has the same value by setting X'(S) = jr Yli=o X({s + i\s £ S}). All the constraints are feasible 
for X' because each is simply the average of 4k feasible constraints. 

In our proof we will be using the following subsets of vertices: 
[i] = {0, 1, 2, . . . , i — 1} (a slice of size i) 

D = {0, 2, . . . , 2k - 4, 2k - 2, 2k + 1, 2k + 3, . . . , Ak - 5, Ak - 3} 
D + = {0, 1, 2, . . . , 2k - 4, 2k - 3, 2A; - 2, 2k + 1, 2k + 2, 2k + 3, . . . , 4k - 4, 4k - 3} . 

Observe from Figure [2] that D D + . Also note that D + is missing only four vertices, two on 
each side almost directly across from each other, and \D\ = 2k — 1. 

Similar to our proof for the 5-cycle, we will prove 62 > n/2 by listing a sequence of constraints 
in the LP B2 that sum and cancel to give us A"(0) > n/2. However, this proof differs from the 
5-cycle proof because we list inequalities implied not only by the constraints in our LP but also 
our assumption of cyclic symmetry. The fact that any two slices of size i have the same X value is 
used heavily in the sequence of inequalities that make up our proof. 
First, we create 2k — 1 X(D + ) terms on the right-hand-side: 

(2k - 2) + X(0) > X(D \ {0}) (slope) 
X([l}) + X(D \ {0}) > X(D + ) + X{$) (submod , decode) 

(2k - 2)((2k - 1) + X($) > X(D + )) (slope , decode) 

Now, we apply submodularity to slices of size i = 2 . . . 2k and an X(D + ) term — canceling all the 
X(D + ) terms we created on the right-hand-side in the previous step. We pick our slices so that 
the union is a slice missing only two vertices, and the intersection is a slice of size i — 1. 

X(D + ) + X([2k\) > X([4k - 2]) + X([2k - 1]) 
X(D + ) + X([2k - 1]) > X([4k - 2]) + X([2k - 2]) 

X(D+)+X([2])>X([4k-2]) + X([l}) 
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4k-5 



2k + 




2k+l 



Figure 2: A 3-regular cyclic Cay ley graph on 4k vertices. Highlighted vertices mark the set D used 
in the proof of Theorem |5.2[ 



If we sum and cancel the inequalities listed so far we have: 

2k{2k - 2) + {2k - 2)X(0) + X([2k\) > (2k - l)X([4k - 2]) 

Now, we combine all 2k — 1 of the X([4k — 2]) terms to get full cycles. 

2X([4k - 2]) > X(V) + X([4k - 3]) 
X([4k - 3]) + X([4k - 2]) > X(V) + X([4k - 4]) 
X([4k - 4]) + X{[4k - 2]) > X(V) + X([Ak - 5]) 

X([2k + 1]) + X(H[4k - 2]) > H(V) + H([2k}) 
Now, we are left with: 

2k(2k - 2) + (2k - 2)X(0) > (2k - 2)X(V) 
We can apply the constraint X(V) > n, yielding: 

2k(2k - 2) + (2k - 2)X(%) > (2k - 2)4k 

thus X($) > 2k for any feasible solution, implying b2>2k = n/2. ■ 

Proof of Theorem ES It is easy to check that Xf f° r these graphs is n/(k + 1), so it is sufficient 



to prove that 62 > n/(k + 1). As we did in the proof of Theorem 5.2 we will assume that our 
solution X has cyclic symmetry. Suppose that n mod (k + 1) = j. Now, consider dividing the 
cycle into sections of size k + 1 and let <S be the set of vertices consisting of the first k in each 
complete section (l^l = k(n — j) / (k + 1)). Then by decoding X(S) = X([—j}) where [—j] is a slice 



of size n — j. We will also use [j] to denote a set of size j, as in the proof of Theorem 5.2 Observe 
that if j = then this observation alone completes the proof. 
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Lemma 5.5. (k + l)X[-j] + X[k] > (k + l)[-j - 1] + X{%) 



Proof. The following inequalities are true by submodularity and the cyclic symmetry of X. In each 
inequality we apply submodularity to two slices, say of size s and t, s < t, overlapping such that 
their intersection is a slice of size s — 1 and their union a slice of size t + 1. 

x{[-j}) + X([-j\) > X([-j + 1]) + X([-j - 1]) 
X([-j]) + X([-j - 1]) > X([-j + 1]) + X([-j - 2]) 
X([-j]) + X([-j - 2]) > X([-j + 1]) + X([-j - 3]) 



X({-j}) + X({-j -(k- 1)]) > X([-j + 1]) + X([-j 
X([-j - k}) + X([k]) > X(0) + X([-j + 1]) 



*]) 



(submod , decode). 



Adding up all of these inequalities gives us the desired inequality. 



Now, if we sum together the following string of inequalities we get the bound we want on X( 
Essentially, we iteratively apply our Lemma to get us to the trivial j = case. 



k(n - j) + (k + 1)X(0) > (k + l)X([-j]) 

jk + jX(<D)>jX([k]) 

(k + l)X([-j}) + X([k}) >(k + l)X([-j - 1]) + X( 
(k + l)X([-j - 1]) + X([k}) >(k + l)X([-j - 2]) + X( 



(slope , decode) 
(slope) 



(by Lemma 5.5) 



(by Lemma 5.5) 



(by Lemma 5.5) 



(k + 1)X([-1]) + X([k}) >(k + l)X(V) + X(0) 
(k + l)X(V) > kn. 

This completes the proof. ■ 
5.3 The broadcast rate of specific small graphs 

For any specific graph one can attempt to solve the second level of the LP-hierarchy directly to 
yield a possibly tight lower bound f3 > b<i- The following corollary lists a few examples obtained 
using an AMPL/CPLEX solver. 

Fact 5.6. The following graphs satisfy 62 = /3 = \f : 

(1) Petersen graph (Kneser graph on vertices): n = 10, a = 4 and (3 = 5. 

(2) Grotzsch graph (smallest triangle-free graph with x = 4j: n = 11, a = 5 and (3 = 

(3) Chvatal graph (smallest triangle-free 4-regular graph with x = 4j: n = 12, a = 4 and (3 = 6. 



6 Coverage functions: a proof of Lemma 



2.10 



Lemma 2.10 (§ 2.2) will readily follow from establishing the following Lemmas 6.1 and 6.2 as it 
is easy to verify that the slope constraints and the i-th order submodularity constraints in our LP 



are equivalent to the inequalities in Eq. (6.1). 
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Lemma 6.1. A vector X , indexed over all subsets of the groundset V , satisfies 



vfl/0,vzni? = 0, ^2(-i)\ r \ t \x(tuz) < 



TCR 



1 if\R\ = l 
otherwise 



if and only if it satisfies: 

VR C V, R + 0, Yl (- 1 )' T| ( X ( fi \ T ) ~ \ R \ T D ^ • 



(6.1) 



(6.2) 



TCR 



Lemma 6.2. A vector X, indexed over all subsets of the ground-set V , satisfies (6.2) if and only 



if there exists a vector of non-negative numbers w(T), defined for every non-empty vertex set T , 
such that X(S) = \S\ + J2t-.tcs w ( T ) yS £ V - 



Proof of Lemma 6.1, First, we claim that X satisfies (6.2) if and only if it satisfies: 

1 if|J2| 

— X(l ) < < 

TCR ^ 



1, 

otherwise. 



(6.3) 



Starting with the inequalities (6.2), observe that we get an equivalent set of inequalities when 



we switch the roles of T and R\T, as it is essentially summing over the complements of T instead of 
T. Additionally, for > 2 we can remove the constant term because it is equal to the alternating 
sum ± ^2i = i(— l) fc ('^')fc = 0. If = 1 then the constant term is one. 



Now, we show the equivalence of (6.3) and (6.1). Clearly, if X satisfies (6.1) then it satisfies 



(6.3) because the inequalities in the latter are a subset of the inequalities in the former. Now, we 



show by induction on the size of Z that (6.3) implies (6.1). Our base case, \Z\ = holds trivially. 



We assume that (6.3) implies (6.1) for \Z\ < \Z*\ and show the following inequality holds: 



(-l) lR ^ T] X(TUZ*) < 



TCR* 



1 if \R\ = 1 
otherwise 



By our inductive hypothesis, Eq. (6.3) implies the following two inequalities from (6.1): 



R = R* U {z} , Z = Z* \ {z} , 
R = R* , Z = Z*\{z} 

for some z G Z* . It is easy to see that (*) — (II) = (I) , thus we can derive (*) from (I) , (II) . 



(*) 



(I) 
(II) 



Proof of Lemma 6.2. Suppose there exists a vector of non- negative numbers w(T), defined for every 
non-empty vertex set T, such that X(S) = \S\ + X/T-Tgs W C-^) ^ ^ ^ as in the statement of our 
Lemma. Then rearranging, we have: 



X(S) -\S\= ™( T ) = E w ^ ySQV 

T-.TCS T:TnS^0 



Now, define F{S) = X{S) -\S\. 
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Lemma 6.3. The set function F satisfies 

VRCV,R^tt, ^(-l) |r| F(fiUT)<0. (6.4) 

TCR 

if and only if there exists a vector of non-negative numbers w(T), defined for every non-empty 
vertex set T , such that 

F{S) = W ( T ) yS ^ V - ( 6 - 5 ) 



Remark 6.4. A set function F is called a weighted set cover function if it can be written as in 



Eq. (6.5). 



Plugging in X{S) — \ S\ for F(S) and noting that RUT = R\T it is easy to see that Lemma 6.3 



implies our desired result. Thus, it remains to prove Lemma 6.3 



Proof of Lemma 6.3, In this proof we will be working with vectors and matrices whose rows and 
columns are indexed by subsets of V. Let n = |V|,JV = 2 n . Expressing F and w as vectors with 
N — 1 components (ignoring the component corresponding to the empty set), this equation can be 
written in matrix form as 

F = Mw, 



where M is the (N — l)-by-(iV — 1) matrix defined by 

M TS -- 



1 if T n S ^ 
otherwise. 



We shall see below that M is invertible. It follows that F can be written as in Eq. (6.5) if and only 
if M~ 1 F is a vector w with non- negative components. 

To prove that M is invertible and to obtain a formula for the entries of the inverse matrix, let 
L be the N-by-N matrix defined by 



Lts 



1 if t n S + 
otherwise. 



In other words, L is the matrix obtained from M by adding a top row and a left column consisting 
entirely of zeros. Let us define another matrix K by 



Kts = 1 — Lts 



1 if T n S = 
otherwise. 



We can now begin to make progress on inverting these matrices, using the observation that both 
K and K + L can be represented as Kronecker products of 2-by-2 matrices. Specifically, 

K "< J) ■ K+L =(\ I 
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The inverse of ( } q ) is ( ? 3l ) • We may now make use of the fact that Kronecker products commute 
with matrix products, to deduce that 



L 



{K + L) 



1 1 
1 1 

1 
1 



(8>n 



1 

-1 
1 

1 



K 



(gin 



1 

-1 
1 1 

1 



Examine the matrices occurring on the left and right sides of the equation above, and consider the 
submatrix obtained by deleting the left column and top row. On the right side, we obtain —I, 
where / denotes the (N — l)-by-(iV — 1) identity matrix. On the left side we obtain M ■ A, where A 
is the matrix obtained from ( ° -i ) ^ n °y deleting the left column and top row. This implies that 
M is invertible and its inverse is —A. Moreover, one can verify that the entries of —A are given by 



(O ifTuS^V 
~ TS ~ \(-l)\TnS\ ■ liTuS = V. 



Recall that a set function F can be expressed as it is in Eq. (6.5) if and only if M~ l F has 
non- negative entries. Now that we have derived an expression for M~ l we find that this criterion 
is equivalent to stating that for all nonempty sets R QV, 



£ ( 

S : TUS=V 



-l)\Tns\ F ( S) < 



This condition is equivalent to Eq. (6.2) because every set S such that TU S = V can be uniquely 



written as the disjoint union of two sets T and R = T D S. This completes the proof of Lemma 6.3 



and subsequently proves Lemmas 6.2 and 2.10 



7 Open problems 

• We provide an information-theoretic lower bound 62 on the broadcast rate /3, enabling us to 
answer fundamental questions about the behavior of /3. While one can have 62 < P, what is 
the largest possible gap between the two parameters? Recalling that the linear program for 
62 contains exponentially many constraints, is there an efficient algorithm for computing 62? 

• Our results include a polynomial time algorithm for determining whether /3 = 2 for any 
broadcasting network. A major open problem is establishing the hardness of determining 
whether j3 < C for a given graph G and real C > 0. While no such hardness result is known, 
presumably this problem is extremely difficult e.g. it is unclear whether it is even decidable. 

• In an effort to approximate /3, we give an efficient multiplicative o(ra)-approximation algorithm 
for the general broadcasting problem. Can we obtain an approximation of f3 (even for case of 
undirected graphs) within a multiplicative constant of n l ~ e for some fixed e > 0? 

• Using certain projective-Hadamard graphs introduced by Erdos and Renyi, we show that the 
broadcast rate can be uniformly bounded while its upper bound b n is polynomially large. Is 
the scalar capacity fi\ of these graphs unbounded as the field characteristic q tends to 00? 
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